Skip to content

Conversation

@TejMakode1523
Copy link

No description provided.

@nemesis-55
Copy link

@TejMakode1523 i am planning to train using your code changes to train for 4bit model. If you have already trained and tested. what is the accuracy different between 32 bit and 4 bit model. Also after training i want to push the model to hugging face in compact way. what is the best way to do so . can you help me

@TejMakode1523
Copy link
Author

@nemesis-55 I didn't check the accuracy difference between the 32-bit and 4-bit models yet. I just fine-tuned the 4-bit model using the LoRA method and have an inference script for it, which might help you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants